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Abstract 

For positive definite matrices A and B, the Araki-Lieb-Thirring inequality amounts 
to an eigenvalue log-submajorisation relation for fractional powers 

X(A t B t ) -< w(log) \\AB), < t < 1, 

while for t>l, the reversed inequality holds. In this paper I generalise this inequal- 
ity, replacing the fractional powers x t by a larger class of functions. Namely, a contin- 
uous, non-negative, geometrically concave function / with domain dom(/) = [0,xo) 
for some positive xq (possibly infinity) satisfies 

X(f(A)f(B)) ^ (log) f\\ l '\AB)\ 

for all positive semidefinite A and B with spectrum in dom(/), if and only if < 
xf'{x) < f(x) for all x 6 dom(/). The reversed inequality holds for continuous, non- 
negative, geometrically convex functions if and only if they satisfy xf'(x) > f(x) 
for all x G dom(/). As an application I derive a complementary inequality to the 
Golden-Thompson inequality. 
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1 Introduction 



The Araki-Lieb-Thirring (ALT) inequality [2117] states that for < t < 1 
and positive definite matrices A and B, the eigenvalues of A t B t are log- 
submajorised by the eigenvalues of (AB) 1 . For n x n matrices X and Y with 
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positive spectrum, the log-submajorisation relation X(X) -< w (\og) X(Y) means 
that for all k = 1, . . . , n, the following holds: 

k k 
3=1 3=1 

This is equivalent to weak majorisation of the logarithms of the spectra 

logA(X) -< w logA(y). 

Here and elsewhere, I adhere to the convention to sort eigenvalues in non- 
increasing order; that is, X\{X) > X 2 (X) > . . . > X n (X). 

With this notation, the ALT inequality can be written as 

XiA'B 1 ) -< w(log) \\AB), < t < 1. (1) 

For positive scalars a and 6, ((Tj) reduces to the equality a*6* = (a&)*. 

One can ask whether similar inequalities hold for other functions than the 
fractional powers x f . One possibility is to consider functions that satisfy 

X(f(A)f(B)) ^ (log) f(X(AB)). 

As the scalar case reduces to f(a)f(b) < f(ab) these functions must be super- 
multiplicative. Another possibility, and the one pursued here, is to consider 
functions satisfying 

X(f(A)f(B)) ^ (log) f (yJ\{AB)) . (2) 

Here, the scalar case reduces to f(a)f(b) < f 2 (y/ab), for all a, b > 0. Functions 
satisfying this requirement are called geometrically concave (see Definition 
[1] below). In this paper I completely characterise the class of geometrically 
concave functions that satisfy (j2J) for all positive definite matrices A and B. 

Likewise, as inequality ([1]) holds in the reversed sense for t > 1, one may ask 
for which functions / the reversed inequality holds for all positive definite 
matrices A and B: 

f (v/A(AB)) ^ (log) X(f(A)f(B)). (3) 

Here the scalar case restricts the class of functions to those satisfying the 
relation f 2 {\fab) < f(a)f(b). Such functions are called geometrically convex. 
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I also completely characterise the class of geometrically convex functions that 
satisfy ([3]) for all positive definite matrices A and B. 

The concepts of geometric concavity and geometric convexity were first studied 
by Montel [5] and have recently received attention from the matrix community 
[515] . 

Definition 1 Let I be the interval I = [0,xq), with xq > (possibly infi- 
nite). A function f : I — >■ [0, oo) is geometrically concave if for all x,y G 

I, y f(x)f(y) < f(\/zy)- It is geometrically convex if for all x,y G /, 

Jf(x)f(y)>f(^xy). 

Equivalently, a function f(x) is geometrically concave (convex) if and only if 
the associated function F(y) := log(/(e y )) is concave (convex). 

The main results of this paper are summarised in the next section, the proofs 
of the main theorems (Theorems [1] and [2]) are given in Section [3], and the paper 
concludes with a brief application in Section HI 



2 Main Results 

To state the main results of this paper most succinctly, let us define two classes 
of functions. 

Definition 2 A continuous non-negative function f with domain an interval 
I = [0, x ) is in class A if and only if it is geometrically concave and its 
derivative f satisfies < xf'(x) < f(x) for all x G / where the derivative 
exists. 

Definition 3 A continuous non-negative function f with domain an inter- 
val I = [0, xo) is in class B if and only if it is geometrically convex and its 
derivative f satisfies xf'(x) > f(x) for all x E I where the derivative exists. 

In terms of the associated function F(y) = log(/(exp?/)), / G A if and only 
if F(y) is concave and < F'(y) < 1 for all y where F is differentiable, 
and / G B if and only if F(y) is convex and 1 < F'(y) for all y where F is 
differentiable. 

There is a simple one-to-one relationship between these two classes; essentially 
/ is in class A if and only if its inverse function f~ l is in class B. However, 
some care must be taken as A contains the constant functions and also those 
functions that are constant on some interval. 
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Proposition 1 A function f that is non-constant on the interval [0,xi] C 
[0, Xq) is in class A if and only if the inverse of the restriction of f to [0, X\\ 
is in class B. 

Proof. It is clear that a concave monotonous function / is always invertible 
over the entire interval where it is not constant. We will henceforth identify 
the inverse of / with the inverse of the restriction of / on that interval. 

If F(y) is the associated function of f(x) then the associated function of f^ 1 is 
the inverse function of F, F' 1 . Now / is in class A if and only if F is concave, 
monotonous and F' < 1. This implies that the inverse function G = F~ l is 
convex and satisfies G' > 1, which in turn implies that G is the associated 
function of a function g in class B. This shows that / G A implies / _1 G B. 

A similar argument reveals that the converse statement holds as well. □ 

The main result of this paper is the following theorem: 

Theorem 1 Let f be a continuous non-negative function with domain an 
interval I = [0,Xq), xq > (possibly infinite), then 



holds for all positive definite matrices A and B with spectrum in I if and only 
if f is in class A. 

That the right-hand side of (jlj) is well-defined follows from the following 
lemma: 

Lemma 1 If A and B are positive semidefinite matrices with eigenvalues in 
the interval I = [a, b], < a < b, the positive square roots of the eigenvalues 
of AB are in I as well. 

Proof. We have a < A, B < b, which implies 



Hence, a 2 < Xi(A 1/2 BA 1/2 ) < b 2 , so that a < \] /2 (AB) < b. □ 

A simple consequence of Theorem [T] is that the reversed inequality holds if 
and only if / is in class B. 

Theorem 2 Let g be a continuous non-negative function with domain an in- 
terval I = [0,xq), xq > (possibly infinite), then 




(4) 



a 2 <aA< A^BA 1 ' 2 < bA < b 2 . 




(5) 
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holds for all positive definite matrices X and Y with spectrum in I if and only 
if g is in class B. 



3 Proofs 



We now turn to the proofs of Theorems [T] and [2J 



3. 1 Proof of necessity 



To show necessity of the conditions / e A (/ G B) I consider two special 2x2 
matrices with eigenvalues a and b, < b < a, such that a e dom(/) and / is 
differentiable in a: 



A 



/ a 
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B = C/ £T, with [/ 



cos — sin 6 1 



sin 9 cos # 



We will consider values of 9 close to 0. The largest eigenvalue of AB can be 



calculated in a straight-forward fashion. The quantity / 2 (y Ai (AB)) can then 
be expanded in a power series of the variable 9. To second order this yields 

In a similar way we also get 

x 1 (f(A)f(B)) = m 2 - f{a) ]\^tn^ )) e2 + 0{e% 

Hence, to satisfy inequality (j4]), the following must be satisfied for all < b < 
a E dom(/): 

a(a-b)f(a)f'(a) /(a) 2 (/(a) - /(6)) 
a + b ~ f(a) + f(b) ' 

In particular, take b = 0. As / has to be geometrically concave, /(0) = 0. The 
condition then becomes 

af(a)f\a)<f(a) 2 ,\faedom(f), 

which reduces to the defining condition for / e A. 

Necessity of the condition xf'(x) > f(x) in Corollary |2] also follows immedi- 
ately from this special pair of matrices. 
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Note that in the preceding proof we see why the domain of / should include 
the point x = 0. 



3.2 Proof of sufficiency for Theorem^ 

Now I turn to proving sufficiency of the condition / G A. The main step 
consists in showing that the set of functions / for which the inequality (j3J) 
holds is 'geometrically convex'; that is, the set of associated functions F for 
these / is convex. To show this a number of preliminary propositions are 
needed. 

Lemma 2 Let R\, R 2 , S\ and S 2 be positive semidefinite matrices such that 
Ri commutes with R 2 and Si with S 2 . Let R = R± 2 R^ 2 and S = S\' 2 S 2 . 
Then 

X 1 {{R l ' 2 SR 1 / 2 ) 2 ) < \ 1 (r{ /2 S 1 r{ /2 Rl /2 S 2 Rl /2 ). (6) 
Proof. We will prove this by showing that the equality 

Xi{r\ /2 SiR\ /2 rI /2 s 2 rI /2 ) = a 

implies the inequality Xi((R 1/2 SR 1/2 ) 2 ) < a. 

W.l.o.g. we can assume that the matrices Ri and S\ are invertible; then the 
equality indeed leads to the following sequence of implications: 



X 1 {R\ ,2 S x R\ /2 R\ 12 S 2 R\ 12 ) = a 
^(R\ /2 S 1 R 1 1 / Y 2 r 1 2 I2 s 2 r\ /2 (R 1 1 /2 S 1 Rl / Y 2 < a 
^Ry 2 S 2 Ry 2 <aR- x l ' 2 S^K- x 1 ' 2 

Q l/2„l/2 nl/2 Q pV2 pl/2 Q l/2 _ Q l/2 p Q p Q l/2 
=? Oi JrCi rt 2 o 2 ri 2 iti Oi — Oi -tto 2 rtOi ^ ft 

^ ai{Sl' 2 RS\ 12 ) < yfc 

^\Xi{S x 2 l2 RS X i l2 )\<^Ta. 

The last implication is the simplest case of Weyl's majorant theorem. 

Now note that S 2 /2 'RS{' 2 and S l ' 2 RS 1 / 2 = Sl^S^RS^S^ 4 have the same 
non-zero eigenvalues. Hence, X^S^RS 1 ' 2 ) = X^R^SR 1 / 2 ) < y/a, and the 
inequality X^R^SR 1 / 2 ) 2 ) < a follows. □ 
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Proposition 2 Under the conditions of Lemma\^ 

n hr\ I2 r\' 2 s\ /2 si 12 ) < n ^{RxSx) \] I2 {s 2 r 2 ). (7) 

1=1 i=l 



Proof. Since each side of ([SD is the largest eigenvalue of a product of powers of 
matrices, we can use the well-known Weyl trick of replacing every matrix by its 
antisymmetric tensor power to boost the inequality to the log-submajorisation 
relation 



,J 2 C n \ 2 ?;) J 2^-^1/2^ 

i=l i=l 



]JH(R 1/2 SR 1/2 ) 2 ) < UHRl /2 SiR{ /2 r\ I2 s 2 r\ 



Combining this with Lidskii's inequality \[ k i=1 Xi(AB) < Ui=i Xi(A)Xi(B) ([1], 
Corollary III. 4. 6), valid for positive definite A and B, and then taking square 
roots yields inequality ([7]). □ 

Inequality ([7]) can be interpreted as midpoint geometric convexity of the func- 
tion 

h{p) = ilHRlR\- p SfSl-*); 

i=l 



that is, /fc(l/2) < J fk(l)fk(0). We now use a standard argument (see e.g. 
the proof of Lemma IX. 6. 2 in jl]) to show that this actually implies geometric 
convexity in full generality, i.e. fkij>) < fk(X) v fk(fy 1 ~ p for all p G [0, 1]. 

Proposition 3 Under the conditions of Lemma\^ and for all p G [0, 1], 

f[ HR p iRl' p s p s^ p ) < n WRiSi) \}- p (s 2 R 2 ). (8) 



i=l i=l 



Proof. By Proposition [2] the inequality holds for p = 1/2. It trivially holds for 
p = and p = 1 . 

Let s,t G [0, 1] be given. Applying Proposition [2] with the matrices Ri, Si, R 2 
and £2 replaced by i?^ - *, ^5 , 2 1 "*, R\R\~ S and S{Sl~\ respectively, yields 
the inequality 



_ f ?(*+*)/2 z? l-(s+t)/2 Q {s+t)/2 Q l-(s+t)/2^ 

1 1 

k 

^ 1/2 1 jyt jyl—tQtQl — t\ \ 1/2 1 ps pl-s q1 — s\ 

i=l 



Y[ \i(Ri +t)l2 R)r {s+t)l2 s[ s+t)/2 si 

k 



i=l 

Now assume that the inequality ([HD holds for the values p = s and p = t. Thus 
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<f[Af (it^) Aj^Cftife) Af (fl^i) tf-'^iSiRa) 
i=i 

= flAr )/2 (i? 1 5 1 ) Aj- ( ^ )/2 (5 2J R 2 ). 
j=i 

In other words, the assumption that (jSJ) holds for the values p = s and p = t 
implies that it also holds for their midpoint p = (s + 1)/2. 

Using induction this shows that (jSJ) holds for all dyadic rational values of p 
(i.e. rationals of the form k/2 n , with k and n integers such that k < 2 n ). 
Invoking continuity and the fact that the dyadic rationals are dense in [0, 1], 
this finally implies that (jSJ) holds for all real values of p in [0, 1]. □ 

We are now ready to prove our first intermediate result: convexity of the set 
of associated functions F for which the inequality (jl]) holds. 

Proposition 4 Let fi(x) and f2(x) be two continuous, non-negative functions 
with domain an interval I of the non-negative reals, and for which holds 
for all positive semide finite A and B with spectrum in I . Let p G [0, 1] and let 
f(x) = f\{x)fl~ p {x). Then (^Jj holds for f too. 

Proof. Let us fix the matrices A and B and let Ri = fi(A) and Si = fi(B), 
i = 1,2. These matrices Ri and Si clearly satisfy the conditions of Proposition 
[3] (positivity and commutativity). Hence 

n Hf(A) f(B)) = n HmA)ft p (A) fmti^m 

8=1 8=1 
fc 

<Iimi(A)h(B))\}- p (f 2 (A)f 2 (B)). 

8=1 

By the assumption that fi and f 2 satisfy inequality PJ, this implies 
II W(A) f(B)) < fi f?{\] /2 {AB)) f 2 2 {1 - p \\] /2 (AB)) 

8=1 8=1 

= f[f(Xl /2 (AB)) 1 

8=1 

i.e. / satisfies inequality (jl]) as well. □ 
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We have already proven that membership of this class is a necessary condition 
for inequality (jJJ) to hold. The set of associated functions F for functions in 
class A is the set of concave functions F that satisfy < F'(y) < 1 for all 
y in the domain of F where F is different iable. This set is convex, as can 
be seen from the fact that, for f € A, F' is non- in creasing and the range 
of F' is [0, 1]. Hence, F' is a convex combination of step functions $(& — y) 
(with $ the Heaviside step function) and the constant functions and 1: 
F'iy) = r + s J(_ 00)+00 ) $(& — y)dfj,(b), where r, s > 0, r + s < 1, and <i/i is a 
probability measure (normalised positive measure). Hence, such F have the 
integral representation 

F(y) = a + ry + s J min(y, t) d/J,(t). (9) 

(— oo,+oo) 

The additive constant a corresponds to multiplication of / by e a , so we may 
assume that a = 0. Since r + s < 1 it then follows that / is in the geometric 
convex closure of f(x) = 1, f(x) = x and f(x) = min(x, c) for c £ I (c = e*). 

The next step of the proof is to show that inequality (j4]) holds for these 
extremal functions. For the functions f(x) = 1 and f(x) = x this is of course 
trivial to prove. Hence let us consider the remaining function f(x) = min(x, c), 
with c 6 /. As the constant c can be absorbed in the matrices A and B, we 
only need to check the function f(x) = min(x, 1). The action of this function 
on a matrix A is to replace any eigenvalue of A that is bigger than 1 by the 
value 1. 1 denote this matrix function by min(A, 1). For this function a stronger 
inequality can be proven than what is actually needed. 

Lemma 3 For A,B>0, and for any i 

Ai(min(A, 1) min( J B, 1)) < min(Ai(AB), 1). 

Proof. Let A 1 = min(A, 1) and B 1 = min(i?, 1). We have A 1 < A and B x < B, 
so that, using Weyl monotonicity of the eigenvalues twice, 

\ t (A 1 B 1 ) = \ l (A{ /2 B 1 A{ /2 ) 

< \i(A\ /2 BA\ /2 ) 
= \ l {B l l 2 A 1 B 1 ' 2 ) 

< \i(B l/2 AB l/2 ) 
= Xi(AB). 

This implies also that min(A i (A 1 _B 1 ), 1) < min(Aj(A5), 1). 
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We also have A\ < I and B\ < I, hence by Lemma [T] min(Aj(Ai5i), 1) = 
\{A 1 B 1 ). □ 

Since the inequality of this lemma implies the weaker log-submajorisation 
inequality 

A(min(A, 1) mm(B, 1)) -< w(log) min(A(AB), 1), 
all extremal points of the class A satisfy the inequality (j4j). 

Finally, by Proposition H]this implies that fll]) holds for all functions in A, hence 
membership of A is a sufficient condition. This ends the proof of Theorem 

m □ 

3. 3 Proof of sufficiency for Theorem 

Let A = g(X) and B = g(Y), with g = f' 1 . Thus, X = f(A) and Y = f(B). 
Since / is in A, g is in B. Inequality (jlj then gives 

X(XY) ^ (log) f 2 (y/\(g(X)g(Y)j) . (10) 

The right-hand side features the function w(x) = f 2 (y/x). Because / is geo- 
metrically concave, so is w. The inverse function w~ l is given by w^ 1 {y) = 
g 2 (^/y). Therefore, w~ l is geometrically convex. Furthermore, because /' is 
non-negative, w' 1 is monotonously increasing. 

A monotonous convex function preserves the weak majorisation relation ([1], 
Corollary II. 3. 4). Thus, a monotonous geometrically convex function preserves 
the log-submajorisation relation. Hence, when w -1 is applied to both sides of 
ffTUl) one obtains 

w-\X(XY)) ^ (log) \{g{X)g(Y)), 

which is ©. □ 



4 Application 

An interesting application concerns the function f(x) = 1 — exp(— x), which 
is in class A. A simple application of Theorem [1] leads to an inequality that is 
complementary to the famous Golden-Thompson inequality Trexp(A + B) < 
Trexp(A) exp(-B) (where A and B are Hermitian). 

In [6] (see also []]) the inequality 

Ti(exp{pA)#ex-p(pB)) 2/p < Trexp(A + B) 
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was proven, for every p > and Hermitian A and B. This is complementary 
to the Golden-Thompson inequality because it provides a lower bound on 
Trexp(A + B). The bound obtained below is complementary in a different 
sense, as it provides an upper bound on Tre~ A e~ B (for positive A and B). 

Theorem 3 For A,B>0, and with C = (A 1/2 BA 1/2 ) 1/2 , 

Tr(e- A e- B ) < Tr(e~ A + e~ B ) + Tr(e~ 2C - 2e~ c ). (11) 

Proof. The inequality can be rewritten as Tr f (A) f(B) < Tr/ 2 (C), with 
fix) = 1 — e~ x . By Lemma 5 in [3 J f(x) is geometrically concave. More- 
over, / is in A: obviously, /' > 0; secondly, f'(x) = exp(— x) < 1/(1 + x), so 
that xf'(x) = xexp(— x) < 1 — exp(— x) = f(x). 

Hence / satisfies the conditions of Theorem [TJ The inequality follows imme- 
diately from that theorem, as log-submajorisation implies weak majorisation, 
and majorisation of the trace, in particular. □ 
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